Text to Phoneme Conversion in Persian Using Smooth Ergodic Hidden Markov Model

نویسنده

F. Hendessi

چکیده

In developing a text-to-speech system, it is well known that the accuracy of information extracted from a text is crucial to produce high quality synthesized speech. In this paper, a Persian text to speech system is studied. The system uses speech waveform concatenation method that is comparatively mature in text-to-speech synthesis. This paper describes the innovation introduced into the text analyzer module in a text-to-speech system. In this analyzer, a probabilistic model is used along with a database for text to phoneme conversion. We call this probabilistic model Smooth Ergodic Hidden Markov Model (SEHMM), and show that is an effective choice for text to speech applications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

Application of Smooth Ergodic Hidden Markov Model in Text to Speech Systems

In developing a text-to-speech system, it is well known that the accuracy of information extracted from a text is crucial to produce high quality synthesized speech. In this paper, a new scheme for converting text into its equivalent phonetic spelling is introduced and developed. This method is applicable to many applications in text to speech converting systems and has many advantages over oth...

متن کامل

Letter-to-Phoneme Conversion for a German Text-to-Speech System

This thesis deals with the conversion from letters to phonemes, syllabification and word stress assignment for a German text-to-speech system. In the first part of the thesis (chapter 5), several alternative approaches for morphological segmentation are analysed and the benefit of such a morphological preprocessing component is evaluated with respect to the grapheme-to-phoneme conversion algori...

متن کامل

Grapheme-to-phoneme conversion for Chinese text-to-speech

This paper reports a study of grapheme-to-phoneme (G2P) conversion for Chinese text-to-speech (TTS) system. As Chinese is a syllabic language, syllable is commonly adopted as the phonetic unit in TTS, which is represented by pinyin, the standard Chinese romanization. A Chinese G2P conversion is to find correct pinyin for polyphonic graphemes in the input text. In this paper, a complete G2P fram...

متن کامل

A hidden Markov model for Persian part-of-speech tagging

One of the important actions in the processing of languages is part-of-speech tagging. Against of this importance, although numerous models have been presented in different languages but there is few works have been done in Persian language. In this paper, a part-of-speech tagging system on Persian corpus by using hidden Markov model is proposed. Achieving to this goal, the main aspects of Pers...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2004

Text to Phoneme Conversion in Persian Using Smooth Ergodic Hidden Markov Model

نویسنده

چکیده

منابع مشابه

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Application of Smooth Ergodic Hidden Markov Model in Text to Speech Systems

Letter-to-Phoneme Conversion for a German Text-to-Speech System

Grapheme-to-phoneme conversion for Chinese text-to-speech

A hidden Markov model for Persian part-of-speech tagging

عنوان ژورنال:

اشتراک گذاری